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(57) The invention provided for a apparatus and 
method for improving the input/output performance of a 
computer system under the control of a multi-tasking, 
multi-threaded operating system. In particular, the in- 
vention provides an apparatus and method to chain con- 
tiguous DMA scatter gather sub blocks of a physical re- 
gion descriptor table for channel 0 with contiguous DMA 
scatter gather sub blocks of physical region descriptor 
table tor channel 1 , using a single data manager, while 
maintaining maximum media bandwidth. DMA block 
transfers are scheduled based on the availability of data 
from the buffer memory of input/output devices (1 27, 
129), thus minimizing both media or network idle time 
as well as minimizing input/output bus idle time. Near 
maximum aggregate bandwidth ot multiple input/output 
buses and their associated devices is then obtained, 
and the apparatus and method thus provides significant 
performance advantages over prior techniques having 
two Input/output channel systems implemented with a 
single data manager. 
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Description 

The present invention relates to a method and ap- 
paratus for controlling (n+i) I/O channels with (n) data 
managers in a homogeneous software programming 
environment. 

In particular the invention can relate to computer system 
input output (I/O) operations, and more particularly to 
an apparatus and method for improving I/O perform- 
ance of a multiple I/O channel computer system at min- 
imal cost. 

In a particular embodiment, the invention is intended for 
use in computer systems controlled by a multiple task- 
ing, multiple threaded operating system such as Win- 
dows NT, OS/2, System 7.5, Novell or UNIX, but can 
also provide significant cost advantages in computer 
systems controlled by single threaded and/or single 
tasking operating systems such as Windows or DOS. 

A typical computer system generally includes one 
or more central processing units (CPUs), main memory 
and one or more I/O channels. The functions of the I/O 
channels are to transfer data between the CPU's main 
memory and either storage units or network interface 
devices. Storage units store data and programs which 
the CPU uses in performing specific programming 
tasks. Typical storage units include hard disk drives, 
CDROMs and tape drives. Network interface devices al- 
low the transfer of data to or from other computer sys- 
tems on the network. Typical network interfaces include 
ethernet. fiber channel. ATM and FDDI. 

Recently, applications run by the CPU have moved 
towards using the high-perfomnance processing power 
of microprocessors (such as the Intel Pentium. Power- 
PC and MIPs R4400) to generate real-time, full motion 
video. To support these applications, substantially more 
I/O bandwidth is required. Typically, full motion video 
windows are limited in size to a fraction of the video mon- 
itor size due to limitations in I/O bandwidth and or limi- 
tations in the processing power of the CPU or video 
card. Users operating computer systems in multi-win- 
dow, graphics intensive environments expect instant re- 
sponse to mouse clicks or commands. Mouse clicks on 
an ICON representing a large text file or application 
should result in immediate visual results, whether or not 
the selected file resides on an I/O channel, or whether 
or not an I/O device is currently transferring large blocks 
of full motion video data. This instant response expec- 
tation, when coupled with the high-bandwidth transfer 
requirement, dictates that future I/O controllers allow 1/ 
O requests to be preempted and rescheduled at a later 
time. 

More recently, new techniques have been proposed 
for providing low-cost, high-performance I/O in the per- 
sonal computer (PC) area. Enhanced IDE disk drives 
having large buffers, capable of interfacing to IDE chan- 
nels having over sixteen megabyte per second DMA 
bandwidth, have been demonstrated and produced, and 
will likely become mainstream in the PC industry over 



the next few years. Brad Hosier has proposed an IDE 
interface specification for improving system perform- 
ance in multitasking environments, called "A Program- 
ming Interface for Bus Master IDE Controllers", which 

5 assumes the capabilities inherent in the DMA capable 
IDE disk drives mentioned earlier. Hosier's proposal 
specifies a simple scatter/gather mechanism which al- 
lows large or small blocks of I/O transfer data to be scat- 
tered to or gathered from main memory. This mecha- 

10 nism cuts down on the number of processor internjpts 
required and the total number of I/O device interactions 
with the CPU when transferring large blocks of data, 
such as that required to support, full motion video. Al- 
though the scatter gather programming interface spec- 

15 ifications were originally intended specifically for con- 
trolling hard disk drives on an IDE channel, software 
drivers can be written to access other types of storage 
devices on the IDE channel, storage devices on other 
types of I/O channels or network interface devices using 

20 the same homogeneous scatter gather programming in- 
terface. 

Fig.1 shows a known apparatus used to perform 
bus master scatter gather DMA operations. An I/O con- 
trol device 20 contains one or more descriptor table 

25 pointer registers 22, a data manager 24, an I/O interface 
26 and a CPU interface 28. Each data manager 24 con- 
tains (i) a dedicated DMA engine for managing bus mas- 
ter data transfers between the I/O device (not shown) 
and main memory 30, and (ii) buffer memory required 

30 for maintaining maximum data transfer bandwidth be- 
tween the I/O channel or interface bus 32 and the proc- 
essor interface 28. The I/O control device 20 (also 
known as an I/O controller) connects to the CPU 34 and 
main memory 30 through a bridge 36, via an interface 

35 21 - The bridge 36 may simply provide an extension of 
the processor's bus, or may buffer and extend the proc- 
essor bus using an entirely different bus structure and 
protocol such as PCI. These types of bridges are com- 
monly known in the art. 

40 Besides containing CPU program and data storage, 
main memory 30 contains one or more physical region 
descriptor tables 37 having entries 38. The main mem- 
ory 30 allows storage of blocks of DMA data 40 trans- 
ferred to or from the I/O devices on the I/O channel or 

45 interface bus 32. Data regions 40 represent blocks of 
linear DMA transfers of contiguous double words of da- 
ta, as specified by a respective entry 38 In the physical 
region descriptor table 37. 

Fig 2 shows a prior method used for perfomning I/O 

50 operations. At step 42, the CPU prepares a physical re- 
gion descriptor (PRD) table in main memory. Each PRD 
table entry is 8 bytes tong and consists of an address 
pointer to the starting address and the transfer size of 
the memory buffer to be transferred. The PRD table may 

55 contain up to 81 g2 PRD table entries, with each entry 
specifying a single, contiguous block of data to be trans- 
ferred between the I/O channel device and main mem-, 
ory. Sequential PRD table entries may specify that a 
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block of data be transferred to or from any available lo- 
cation in main memory. 

At step 44, the CPU writes the starting address of 
the main memory physical region descriptor table to the 
i/O control device's descriptor table pointer register. The 
CPU writes the I/O controller's DMA transfer direction 
bit and clears appropriate I/O controller interrupt bits 
and error flags resulting from previous transfers. 

At step 46, the CPU issues the appropriate Df\/1A 
command to the I/O device. 

At step 48, the CPU writes a start scatter gather 
command to the I/O controller. 

At steps 50, 52 and 54, the I/O controller performs 
a bus master scatter gather DMA operation by sequen- 
tially performing all contiguous DMA block transfers be- 
tween the I/O device and main memory as specified by 
the complete set of entries within a PRD table. 

At step 56, the I/O device signals that the requested 
data transfer Is complete. The I/O controller provides 
signals the processor when allot the requested data has 
travelled through the I/O controller's internal FIFOs and 
has been written successfully to either the main memory 
or the I/O device. 

The I/O controller is usable for the next bus master 
DMA operation. 

Disadvantages do however arise in the known 
methods and apparatus. The apparatus previously de- 
scribed in Fig. 1 may be extended to allow multiple I/O 
channel bus master scatter gather operations. Clearly, 
by adding additional physical region descriptor (PRD) 
tables, "n" dedicated PRD tables may exist, one for each 
I/O channel to be managed by the I/O control device, 
regardless of each I/O channel's protocol and function; 
so long as the I/O channel and it's associated I/O devic- 
es are capable of DMA transfers. Furthermore, from one 
to "n" data managers may exist, whose function is to 
transfer data between the I/O devices residing on the 
"n" I/O channels and main memory. Prior art implemen- 
tations for performing these data transfers typically ded- 
icate a single data manager to each I/O channel be- 
cause the methods used to control the data managers 
are not successful in maintaining maximum possible 1/ 
O bandwidth between the I/O devices and main memo- 
ry. 

Fig. 3 shows single and dual data manager perfomn- 
ance scenarios for a dual channel scatter gather I/O 
controller using the prior apparatus and methods just 
described. It should be noted that the I/O channel is idle 
when the active I/O device's buffer has been emptied by 
IhsM/p channel DMA. As yyil[ be appreciated, the I/O 
channels shown in Fig. 3 perfoma at a bandwidth of twice 
the media rate of the I/O device. 

Historically, I/O devices, due to the nature of their 
technology and their mechanical or network delays, are 
able to sustain data transfers at only 1/16 to 1/2 of the 
I/O bus bandwidth. Most I/O devices contain buffer 
memories (32 to 256 K bytes in size) to compensate for 
their low media or network bandwidths. Thus, once the 



slow media rate has partially filled the devices buffer 
memory, I/O bus transfers may progress at data rates 
dictated by the faster bus bandwidth rather than rates 
dictated by the slower media or network performance. 

5 However, once the buffer space has been exhausted 
due to a transfer which is larger than the buffer size, 
transfer bandwidth reverts back to the media or network 
bandwidth. It should also be noted in Fig 3 that the prior 
method of performing DMAs does not efficiently utility 1/ 

70 o device buffer memory during scatter gather opera- 
tions. In fact, substantial media idle time and I/O bus idle 
time is incurred when using prior art methods with a sin- 
gle data manager and two I/O channels. The single data 
manager mechanism compromises the aggregate I/O 

15 performance by requiring the completion of and I/O 
channel's PRD table prior to starting a bus master op- 
eration on the alternate channel. Dual data manager 1/ 
O controllers overcome these problems, but are expen- 
sive to implement due to their large FIFOs and complex 

20 DMA engines. 

In spite of numerous recent I/O related develop- 
ments and proposals in the PC industry, additional ca- 
pabilities are needed for achieving maximum I/O per- 
formance in multi-tasking, multi-threaded operating sys- 

2S tems, at reasonable cost. 

The present invention seeks to provide a data 
processing method and apparatus having advantage 
over known apparatus and method. 

According to one aspect of the present invention 

30 there is provided a method of transferring blocks of data 
from a plurality of input/output devices in a data process- 
ing system, characterised by the step of 

scheduling DMA block transfers based on the 
availability of data from the plurality of input/output de- 

35 vices. 

According to another aspect of the present inven- 
tion there is provided data processing apparatus com- 
prising central processing means, memory means, in- 
put/output controller means coupled between said cen- 

40 tral processing means and said memory means, and 
characterised by a plurality of input/output devices cou- 
pled to said input/output controller means, said input/ 
output devices each having a buffer, wherein said input/ 
output controller comprises means for transferring data 

45 from said input/output device to said memory means 
based upon availability of data in at least one of said 
buffers. 

According to a further aspect of the present inven- 
tion there is provided a method of performing DMA data 

50 transfers in a data processing system, charactered by 
the steps of interleaving blocks of data based upon lo- 
cations of scatter gather physical region descriptor en- 
tries maintained in a table. 

The present invention therefore advantageously 

55 provides an apparatus and method for improving the in- 
put/output performance of a computer system under the 
control of a multi-tasking, multi-threaded operating sys- 
tem. In particular, the invention provides an apparatus 
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and method to chain contiguous DMA scatter gather sub 
blocks of a PRD table for channel 0 with contiguous 
DMA scatter gather sub blocks of a PRD table for chan- 
nel 1. using a single data manager, while maintaining 
maximum media bandwidth. DMA block transfers are 
scheduled based on the availability of data from the I/O 
device's buffer memory, thus minimizing both media and 
network idle time as well as minimizing I/O buses and 
their associated devbes is obtained. The apparatus and 
method thus provides significant performance advan- 
tages over prior techniques having two I/O channel sys- 
tems implemented with a single data manager. 

The present invention can therefore provide an ap- 
paratus and method for further improving computer sys- 
tems I/O performance at minimal cost. 

The apparatus and method provides a means for 
preempting outstanding and in-progress DMA requests 
to an I/O device on a given I/O channel when a higher 
priority request is encountered for a device on the same 
channel or for the same device. 

The present invention is advantageous in providing 
an improved data processing system. 

Also, the present invention can provide an improved 
I/O operation in a data system, and provide improved 
DMA operations in a data processing system. 

Further, the present invention can advantageously 
also provide improved DMA perfornnance. at minimal in- 
cremental cost, in a data processing system. 

In a homogeneous programming Interface environ- 
ment as described prevbusly, the invention can be read- 
ily extended to control 'n+i' I/O channels using "n" data 
managers. 

Further, the apparatus can provide a performance 
efficient method for software drivers to make use of 
preemptive scheduling techniques found in newer multi- 
tasking, multi-threaded operating systems. Preemptive 
support lessens the response time of performance crit- 
ical I/O requests. 

The invention is described further hereinafter, by 
way of example only, with reference to the accompany- 
ing drawings in which: 

Fig. 4 is a block diagram of an apparatus for per- 
forming single data manager controlled bus master 
scatter gather DMA operations in a two I/O channel 
system; 

Fig. 5 is a diagram of the method for performing sin- 
gle data manager controlled bus master scatter 
gather DMA operations in a two I/O channel sys- 
tems; 

Fig. 6 illustrates a method of merging concurrent in- 
terrupts; 

Fig. 7 illustrates a method of arbitration and selec- 
tion by a data nnanager. 



Fig. B contains performance analysis diagrams for 
single data manager, dual I/O channel operations; 

Fig. 9 contains diagrams showing possible modes 
s of operation supported by the current invention; 

Fig. 1 0 is a block diagram of particular operating en- 
vironment of the present invention, using a PCI/Du- 
al IDE I/O channel control device; and 

10 

Fig. 11 is a block diagram of an alternate operating 
environment of the present invention, using a PCT/ 
Dual IDE/ethernet I/O channel control device. 

15 Fig. 4 illustrates a particular apparatus embodying 
the present invention. An input/output control device 
120 contains a single data manager 124 and I/O inter- 
face logic 126 which interfaces to two IDE I/O channels 
127 and 1 29. A main memory 1 30 contains two physical 

20 region descriptor tables 1 37 and 1 39 for controlling bus 
master DMA transfers on bus/channel 0 and bus/chan- 
nel 1 . A single PRD table is dedicated to each I/O chan- 
nel, as shown at 121 and 123. The data manager 124 
contains the necessary logic to chain DMA scatter gath- 

2S er blocks specified by PRD table entries between chan- 
nels and between PRD tables in an altemating fashion. 
The data manager also contains the necessary logic to 
detemnine whether or not a target device on either chan- 
nel contains DMA data ready for transfer PRD tables 

30 1 37 and 139 are not "swapped" by the data manager 
unless the alternate channel is prepared to transfer da- 
ta. 

Fig. 5 illustrates a particular method embodying the 
present invention and used for perfomning I/O opera- 

35 tions. At steps 1 42 and 1 43, the CPU prepares a phys- 
ical region descriptor (PRD) table in main memory. Each 
PRD table entry is 8 bytes long and consists of an ad- 
dress pointer to the starting address and the transfer 
count of the memory buffer to be transferred. The PRD 

40 table may contain up to 8192 PRD table entries, with 
each entry specifying a single, contiguous block of data 
to be transferred between the I/O channel device and 
main memory. Sequential PRD table entries may specify 
that a block of data be transferred to or from any avail- 

45 able location in memory. The two PRD tables may be 
prepared concurrently or sequentially. 

At steps 144 and 145, the CPU writes the starting 
address of the main memory physical region descriptor 
table to the I/O control device's descriptor table pointer 

50 register The CPU writes the !/p cqntroljer's pMA trans- 
fer direction bit and clears appropriate I/O controller in- 
terrupt bits and error flags resulting from previous trans- 
fers. The CPU nriay write either descriptor table pointer 
register at any time so long as the table to be written is 

55 not an active bus master 

At step 146 or 147 (depending upon which channel 
is being used), the CPU issues the appropriate DMA 
command to an I/O device. A second I/O device on the 
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alternate channel may be issued an additional DMA 
command concurrently with the previously mentioned 1/ 
O device's DMA operation. 

At step 148 or 149 (depending upon which channel 
is being used), the CPU writes a start scatter gather 
command to one channel of the I/O controller. A second 
I/O control devices may be issued the start scatter gath- 
er command concurrently with the alternate channel's 
bus master operation. 

As shown in the remaining steps of Fig. 5, the I/O 
controller performs a bus master DMA operation on the 
first contiguous DMA block transfer available from either 
active I/O device, as specified by the associated PRD 
table entry for that channel. Following the above contig- 
uous DMA block transfer, the data manager transfers a 
block f ronn the alternate channel if data is available from 
the I/O device on that channel. If data is not available 
from the alternate channel, and data Is available from 
the original channel, the data manager transfers a block 
from the original channel. Transfers continue in this 
manner so long as data is available on one of the two 
channels, and the current PRD tables have not been ex- 
hausted. If neither channel has data available for trans- 
fer, the data manager waits for the first available data 
transfer opportunity. Onco data is available from either 
channel, the alternating channel DMA flow described 
above is restarted. 

At steps 156 or 157 (depending upon which channel 
is being used), the I/O device determines the transfer 
status, then signals that the requested data transfer is 
complete. The I/O controller signals the processor when 
all of the requested data has traveled through the I/O 
controller's internal FIFOs and has been written suc- 
cessfully to either main memory or the I/O device. An 1/ 
O device on the alternate channel will continue opera- 
tion following completion of a PRD table by the original 
channel. 

This particular method provides the capability of 
non-destructively preempting either bus master at any 
time during the transfer process. Two types of preemp- 
tion exist, a preemption during mechanical or network 
delays prior to data transfer across the I/O bus, and 
preemption during active DMA data transfers. The first 
preemption (i.e. prior to data transfer) is detected at the 
Tes* exit point from block 158 of Figure 5. The second 
preemption (i.e. during active DMA data transfers) is de- 
tected at the 'Yes' exit point from blocks 154 and 1 55 of 
Fig. 5. 

During preemption, the process/program clears the 
start scatter gather comnnand from the I/O cpntrol device 
for the preempted channel and evaluates the appropri- 
ate descriptor table pointer. 

If the pointer contains the address of the last PRD 
table entry from a previously executed PRD table, then 
the preemption is of the "prior to DMA' type. The proc- 
ess/program clears the active I/O device command, 
sends a new DMA command to the desired I/O device, 
rewrites the PRD tables, saves the current descriptor 



pointer register, rewrites the descriptor pointer register, 
and sends a new start scatter gather command to the 1/ 
O control device. When the preempting PRD table has 
completed it's transfer, the previously saved descriptor 

5 pointer register is reloaded following the reissuing of the 
DMA command to the preempted I/O device. 

If the pointer contains the address of any PRD table 
entry from the current PRD table, then the preemption 
is of the "during DMA" type. The process/program writes 

70 a "stop after current table entry" command to the I/O 
control device. The data manager will complete the cur- 
rent PRD table entry's DMA, then interrupt the proces- 
sor. 

The process/program clears the active I/O device 
'5 command, sends a new DMA command to the desired 
I/O device, rewrites the PRD tables, saves the current 
descriptor pointer register, rewrites the descriptor point- 
er register, and sends a new start scatter gather com- 
mand to the I/O control device. When the preempting 
20 PRD table has completed its transfer, the previously 
saved descriptor pointer register is reloaded following 
the reissuing of the abbreviated DMA command to the 
preempted I/O device. 

Merging of concurrent interrupts can be achieved 
25 and as previously described, multiple contiguous DMA 
blocks, controlled by PRD table entries in a scatter gath- 
er PRD table, are transferred to or from contiguous 
memory space of a specific I/O device. Following the 
final data transfer controlled by the last entry in a PRD 
30 table, the processor is interrupted either by the I/O de- 
vice or the I/O controller depending on the I/O control- 
ler's current configuration, "n" concurrent data transfer 
operations for multiple I/O devices on multiple I/O chan- 
nels complete their transfers based on time scales that 
35 are not deterministic in nature. This characteristic of 
concurrent I/O channel operation requires a mechanism 
to associate a given interrupt with the responsible IAD 
device, channel and operation. Furthermore, concurrent 
I/O operations could generate multiple, overlapping in- 
40 terrupts requiring either a hardware interrupt queuing 
logic, or complex interrupt handling software for the host 
processor. 

As illustrated in Fig. 6, the present invention can 
control interrupt generation through selective arbitra- 
45 tion. When a scatter gather transfer reaches it's final 
PRD table entry, a processor interrupt is eminent, and 
will occur following the successful completion of data 
transfer controlled by that table entry. Other channels' 
concurrent scatter gather DMA transfers are allowed to 
50 proceed through their PRD table so long as they are not 
executing a final block transfer PRD entry themselves. 
If a second final PRD table entry transfer is encountered, 
its operation is deferred until the first channel's transfer 
completion interrupt has been serviced by the host proc- 
55 essor. If more than two active I/O channels exist, and 
both previous channels have reached last table entry 
status, the third I/O channel will proceed with scatter 
gather DMA operations so long as that channel doesn't 
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encounter an eminent end of table condition prior to the 
CPU servicing anticipated or outstanding previous data 
transfer completion interrupts. 

Thus, the described techniques provide discrete, 
non-overlapping interrupts to signal the completion of s 
each I/O block transfer request to the host processor 
This capability minimizes the complexity and improves 
the performance of the host processor's interrupt han- 
dling software, when managing concurrent 1/0 trans- 
fers. Prior methods for handling concurrent interrupts re- to 
quired interrupt queues or complex interrupt handling 
software running on the host CPU. 

First order data manager selection and arbitration 
is based on the capability of a given I/O device to trans- 
fer data at any given point in time. Typical I/O devices ^5 
contain internal buffers which allow data to be trans- 
ferred to or from the I/O device at maximum I/O channel 
bandwidth rather than at the slower media or network 
bandwidths. Fig. 7 illustrates now data transfer is select- 
ed/enabled. During I/O read operations, the I/O device 20 
requests data transfers to the I/O channel when internal 
memory buffers have been filled to a preconfigured 
"high watermark" levei.e.nsuring that the resulting DMA 
to the I/O channel efficiently utilizes the available I/O bus 
bandwidth. During I/O write operations, the I/O device 2S 
requests data transfers from the I/O channel when in- 
ternal memory buffers are at a "low watermark" level; 
once again allowing efficient utilization of the I/O bus 
bandwidth. First order data manager channel selection 
is based directly on an I/O device's ability to transfer a 30 
block of data at any given point in time. This method is 
efficient due to the selectbn process being controlled 
by actual data availability from the I/O device during 1/ 
O reads, or buffer availability within the targeted I/O de- 
vice during I/O writes. This method is much more cost 35 
efficient to implement because the anticipatory logic, 
bus monitoring logic and complex arbitration logic or 
software or alternative approaches is not required. 

Thus, in summary, first order DMA scheduling is on 
an "as needed" basis depending upon whether or not 40 
the targeted I/O devices have transferred enough data 
from (or to) media or the network to partially fill (or emp- 
ty) the I/O device buffer. The apparatus allows various 
second order arbitration and scheduling techniques to 
be used for scheduling bus master DMA operations 
among "n" data managers for "n+i" I/O channels when 
multiple I/O channels meet the first order constraints for 
bus mastership, while seeking the use of a data man- 
ager which has recently become idle. Well knovm arbi- 
t ration a Igorithms such as round rob in , f i rst com e f i rst 50 
serve, least recently used, most recently used and ran- 
dom can be used for second order arbitration. 

Fig. 8 illustrates that the preferred method de- 
scribed hereinabove efficiently utilizes the buffering ca- 
pabilities of the I/O devices as well as the bandwidth of 55 
the two 1/0 channels. It should be noted that, similar to 
the assumptions given with respect to Fig. 3, the I/O 
channel is idle when the active I/O device's buffer has 




been emptied by the I/O channel DMA. The I/O channels 
illustrated Fig. 6 perform at a bandwidth of twice the me- 
dia rate of the I/O device. 

A comparison with Fig. 3 shows that the single data 
manager solution described here perfomns almost as 
well as the previously described dual data nrtanager so- 
lution, and significantly out performs the previously de- 
scribed single data manager, dual I/O solution. Further- 
more, the process/program can tune the I/O transfer 
performance by modifying the transfer size and number 
of the PRD table entries based on the I/O devices' buffer 
size, media rate and priority. 

Typical operating modes allowed by the present in- 
vention are illustrated in Fig. 9. A single channel opera- 
tion is shown at 200, where Channel 0 transfers blocks 
A, B, C and D, followed by Channel 1 transfer of Blocks 
E, F, G and H. 

Concurrent channel operation is shown at 202, 
where the transfer of blocks A, B, C and D from Channel 
0 are interleaved with the transfer of blocks E, F, G and 
H from Channel 1 . Although individual blocks are not 
transferred concurrently, but rather interieaved, the 
overall data transfer can be viewed as being concurrent. 

Concurrent channel operation with Channel 0 
preemption is shown at 204, where data from another 
device on a particular channel (in this example blocks I 
and J from a device 1 on Channel 0) preempts the trans- 
fer of data from a device on the same channel (in this 
example, blocks A, B, C and D from device 0 on Channel 
0). The blocks are initially transferred in a manner similar 
to that illustrated with the normal concurrent channel op- 
eration of 202. However, prior to transferring Block C, a 
preemption on channel 0 is received. The process pre- 
viously described is then performed, including rewriting 
the descriptor pointer register. Channel 1 then continues 
DMA transfer of its next block of data (block G). Channel 
0 then transfers a block of data (block I) from the 
preemptive second device on Channel 0. Channel 1 
then continues DMA transfer of its next block of data 
(block H). Channel 0 then transfers the next block of da- 
ta (block J) from the preemptive second device on Chan- 
nel 0. As this second device has no more data to trans- 
fer, the original PRD for channel 0 is restored, and data 
transfer from the fi rst device on channel 0 continues with 
the transfer of blocks C and D, 

Concurrent channel operation with Channel 1 
preemption is shown at 206. This operates in an analo- 
gous fashion to that just described with respect to the 
preemptive Channel 0 operation of 204 and so further 
discussion |s unnecessary. 

Fig. 10 illustrates the I/O Control Device 120 used 
in the preferred system environment. As shown, the pre- 
ferred environment is for use in interfacing two channels 
of an IDE bus with a PCI bus. IDE buses and PCI buses 
are well known in the art. The I/O control device 1 20 
allows for a more efficient DMA transfer in such an en- 
vironment, as previously described. 

Fig. 11 illustrates an alternate system environment 
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to that of Fig. 10. The techniques described herein can 
be extended to support tvjo channels of an IDE bus in 
addition to an ethernet channel. In order to achieve such 
expansion, an additional descriptor pointer register 125 
for channel 2 is added. An additional DMA engine 225 
is also added to provide specific DMA interface support 
for the additional channel. This DMA engine would op- 
erate in similar fashion to that illustrated in Figure 5, but 
only having flow control for managing a single channel 
(as opposed to the two channels shown in Fig. 5). 

The above described apparatus and method pro- 
vides a mechanism for performance tuning DMA trans- 
fers between main memory and I/O devices, based on 
the media or network bandwidth and buffer size of the 
I/O device. Slower I/O devices (i.e. CDROMS, tape 
drives or ethernet network devices) are controlled using 
a larger number of scatter gather PRD table entries hav- 
ing a smaller block transfer size. Faster I/O devices (ie,. 
hard disk drives or fiber channel switch fabric ports) are 
controlled using a smaller number of scatter gather PRD 
table entries having a larger block transfer size. Like- 
wise. I/O devices having smaller buffer sizes are con- 
trolled using a larger number of scatter gather PRD table 
entries having a smaller block transfer size. I/O devices 
having larger buffer sizes may be controlled using a 
smaller n umber of scatter gather PRD table entries hav- 
ing a larger block transfer size. Software may easily tune 
the I/O controller to provide balanced, high aggregate 
bandwidth system performance. 

It should be appreciated that the present invention 
is not restrictive to the details of the forgoing embodi- 
ments. 
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IS 



25 



30 



A method of performing DMA data transfers in a da- 
ta processing system, characterised by the step of 
interleaving blocks of data based upon loca- 
tions of scatter gather physical region descriptor en- 
tries maintained In a table. 



Claims 35 

1 . A method of transferring blocks of data from a plu- 
rality of input/output devbes in a data processing 
system, characterised by 

scheduling DMA block transfers based on the avail- 40 
ability of data from the plurality of input/output de- 
vices. 

2. Data processing apparatus comprising 

45 

central processing means (34). 
memory means (130), 

input/output controller means (120) coupled 
between said central processing means (34), 
and characterised by a plurality of mput/output so 
devices (1 27,129) coupled to said input/output 
controller means (120). said input/output devic- 
es ( 1 27, 1 29) each having a buffer, wherein said 
input/output controller comprises means for 
transferring data from said input/output device 55 
(1 27, 1 29) to said memory means (120) based 
upon availability of data in at least one of said 
buffers. 
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